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ABSTRACT 

The length of the three-dimensional filaments observed in the fourth public data-release of the 
SDSS is measured using the local skeleton method. It consists in defining the set of points 
where the gradient of the smoothed density field is extremal along its isocontours, with some 
additional constraints on local curvature to probe actual ridges in the galaxy distribution. A 
good fit to the mean filament length per unit volume, £, in the SDSS survey is found to be 
C = (52500 ± 6500) (L/Mpc)-i-^5±o.o6^p(,/(j^00Mpc)3 for 8.2 < L < 16.4 Mpc, where L is the 
smoothing length in Mpc. This result, which deviates only slightly, as expected, from the trivial 
behavior C oc L^^, is in excellent agreement with a ACDM cosmology, as long as the matter density 
parameter remains in the range 0.25 < rimattor < 0.4 at one sigma confidence level, considering the 
universe is flat. These measurements, which are in fact dominated by linear dynamics, are not signif- 
icantly sensitive to observational biases such as redshift distortion, edge effects, incompleteness, and 
biasing between the galaxy distribution and the dark matter distribution. Hence it is argued that the 
local skeleton is a rather promising and discriminating tool for the analysis of filamentary structures 
in three-dimensional galaxy surveys. 

Subject headings: methods: data analysis, statistical — cosmology: large-scale structure 



1. INTRODUCTION 



From the Great Wall of CFAl l|Geller fc Huchra 1989(1 
to the very lo ng filaments s een in the SDS S 
iOott et a,]. 200,5H and the 2DF llColless et al. 20011). 
the ever growing size of the largest structures ob- 
served in the three-dimensional galaxy distribution 
has remained a challenge to models of large scale 
structure formation. It is therefore of prime impor- 
tance to find a robust way to identify filaments in 
the Universe and to characterize them, e.g. through 
their length, thickness and/or average density. To 
achieve that, one usually relies on the analysis of the 
morphological properties -e.g. through structure func- 
tions ( Babul & Starkman 1992), Min kowski functiona l 
l|Mecke et al. 19941|1 . shape finders ( Sah ni et al. 1998H - 
of an excursion's connected components in overdense 
regions of the catalog, wh ich can be obtained using 
friend-of-friend algorithms ijZeFdovi ch ^1^ ^1. 19821). the 
minimum spanning tree technique llBarrowetai. 19851 
iDoroshkevich et al. 2004*) or percolation on a grid 
where the density field has been smoothly interpolated 
lIQott et al. SfitlDominik Shandarin 1991. 

This letter works instead in the framework of Morse 
theory l|Colombi et al. 200(1). and uses the approach pro- 
pose d recently by iNovikov et al.l (2006, hereafter NCD) 
and iSousbie et alJ f2006. hereafter SPCN), where fila- 
ments are seen as a set of special field lines, depart- 
ing from saddle points and converging to local max- 
ima while following the gradient of the density field, 
V/9 = dp/dxi = Pi. However, the skeleton thus de- 
fined remains non-local, which makes analytical calcu- 
lations challenging and edge effects difficult to cope with 
in real catalogs. To solve these issues, a local approxi- 
mation of the skeleton was proposed by NCD in the 2D 
case, and generalized to 3D by SPCN. Given the Hessian, 




Fig. 1. — Derived 3D skeleton in a slice of the SDSS 
for a smoothing length L = 16.4 Mpc. This animation still 
frame shows how well structures are captured, notably the 
SLOAN great wall. The mpeg movie version is available at 
http : //www. pro jet-horizon . f r/articlel73 .html. 



H = d'^p/dxidxj = pij, and its eigenvalues, A^, ranked 
in decreasing order, the local skeleton is defined as the 
set of points where H ■ Vp = XiVp and A2, A3 < 0, to 
ensure that ridges of the density field are probed. 

In this paper, the local skeleton is extracted from the 
SDSS DR4 galaxy catalog. Its total length per unit 
volume is measured and compared to that obtained in 
ACDM cosmologies. Relying on realistic mock catalogs, 
various effects such as incompleteness, survey geometry, 
cosmic variance, redshift distortions, biasing and non- 
linear dynamics, are extensively tested. 

2. OBSERVATIONAL AND MOCK DATA SAMPLES 
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TABLE 1 

Length per unit volume of the skeleton for different SDSS and mock samples 



L DR4-350 DR4-VL350 MOCK MOCK-PS MOCK-AS MOCK-NB 



length density 16.4 372 363 390 ± 19 395± 16 362 403 ± 17 

[Mpc/{100Mpc)3] 10.9 795 772 796 ± 18 815± 19 740 790 ± 24 

8.2 1271 1299 1272 ± 25 1308± 27 1285 1204 ± 25 



A complete description of the Fourth Data Release of 
th e Sloan Digital Sky Survey (DR4 SDSS) can be found 
in lAdelman-McCarthv et al.l (2006). The main sample 
used in this paper is extracted from the Catalog Archive 
Server facility. To ensure proper spectral identification 
of galaxies, objects with specclass = 2 and zconf > 0.35 
are selected in the specphoto table. This yields a main 
sample containing 459, 408 galaxies. The completeness in 
apparent magnitude was investigated and is achieved for 
C^SDSS < 19. Two subsamples were extracted: a sample 
cut at distance d < 350 Mpc containing 148,012 galax- 
ies (hereafter DR4-350), and a homogeneous, volume- 
limited sample (hereafter DR4-VL350) containing 25, 843 
galaxies selected on the basis of their absolute magnitude 
Mabsu < -17 and d < 350 Mpc. 

To test the robustness of the measurements and com- 
pare observational results to theoretical predictions, a 
large ACDM simulation was performed, using the pub- 
hcly available treecode GADGET-2 10), involving 512^ 
particles in a 1024 /i~^Mpc box, and with the follow- 
ing cosmological parameters: Hq = 70 km/s/Mpc, 

^^baryons = 0.05, ilmattcr = 0.3, Q\ = 0.7 and 

normalization erg — 0.92. Various mock catalogs 
were extracted from this simulation, using MoLUSC 
l|Sousbie et al. 2006bD . This tool is designed to build 
realistic mock galaxy catalogs from dark matter simu- 
lations of large volume but poor mass resolution, by re- 
projecting, as functions of local phase-space density, the 
statistical properties of the galaxy distribution (type, 
spectral features, number density, etc.) derived from 
semi-analytic models applied to simulations of higher 
mass resolution . Here, the results calculated by GallCS 
l|Hatton et al. 2 0031 on a treecode simulation with 256 
particles in a cube of 150 Mpc on a side are used as in- 
puts of MoLUSC. According to the analyses of Blaizot et 
al. (2006), this simulation should provide sufficient mass 
resolution to describe realistically the statistical proper- 
ties of SDSS galaxies with Usdss < 19. The advantage 
of using MoLUSC is that it allows one to probe a realis- 
tic volume of the Universe without worrying about finite 
volume or replication effects in the realization of mock 
catalogs themselves (Blaizot ct al. 2005). 

For the purpose of testing the skeleton properties, three 
different kinds of mock catalogs were built, all cut at 
a distance of 350 Mpc: (1) the main catalog is called 
MOCK and attempts to reproduce all the characteris- 
tics of DR4-350 (redshift space distortion, incomplete- 
ness, survey geometry, etc.); (2) MOCK-PS is identical 
to MOCK but uses the exact positions of the galaxies 
to test the effect of redshift distortion; (3) MOCK-AS is 
an all-sky version of MOCK aimed to test the influence 
of survey geometry and finally (4) MOCK-NB is identi- 
cal to MOCK but with dark matter particles (without 



density biasing). The volume of our simulation is ap- 
proximately 30 times that covered by DR4-350, which 
yields an error bar reflecting cosmic variance from the 
dispersion among 25 random realizations of MOCK. Note 
finally that measurements were also performed directly 
on the dark matter distribution simulation boxes and on 
the initial conditions of the simulation, to test the effects 
of nonlinear clustering. 

3. THE 3D SKELETON: ALGORITHM 

The details of the algorithm used to draw the local 
skeleton defined in §nare given in SPCN, so only a brief 
sketch of it is given here: 

(i) interpolation and smoothing: the first step 
consists in performing cloud-in-cell interpolation 
(jHocknev fc Eastwood 19811) on a 512 grid covering a 
700 Mpc cube embedding the survey. To avoid extra 
degeneracies while drawing the skeleton, the empty 
regions of the cube are filled with a random distribution 
of galaxies with 1, 000 times smaller average density than 
inside the survey. At the end of the process, only the 
parts of the skeleton belonging to the original survey are 
kept. To warrant sufficient differentiability, convolution 
with a Gaussian window of size L is performed prior 
to computing the gradient and the Hessian using a 
finite difference method. As argued in NCD, in order 
to avoid contamination by the grid, discreteness and 
finite volume effects, respectively, the smoothing scale 
should verify i/A > 4, L/A > 1 and L/V^/^ < 20 where 
A is the grid step, A the mean interparticle distance 
and V is the survey volume. As a result, the following 
conservative scale range, 8.2 < L < 16.4 Mpc, will be 
used for the measurements performed in this paper. 

(ii) surface intersection modeling: the second step of 
the algorithm consists in drawing the skeleton, noting 
that it is embedded in the set of points verifying TC-Wpx 
Vp = 0. This leads to 3 conditions, Si{xi,X2,X3) — 0, 
that define 3 surfaces intersecting along a common line. 
The actual method used to compute the surfaces Si = 
as an assemblage of triangles and their intersections as 
a set of connect ed lines relies on the cl assical marching 
cube algorithm ijLorensen fc Cline 1987(1 . as detailed fur- 
ther in SPCN. Additional conditions, namely that the 
gradient should be aligned with the major axis of cur- 
vature - in practice, \Vp.Ui\ > max (| Vp.U2|, iVp.Uaj) 
where are the eigenvectors of H- and A2 , A3 < are 
enforced locally after diagonalizing the Hessian. 

(iii) cleaning: some additional treatment has to be per- 
formed in regions where the field becomes degenerate 
{e.g. in the vicinity of critical points, Vp — 0), as ex- 
plained in detail in SPCN. Finally, the parts of the local 
skeleton which do not pass through any critical point are 
removed. As argued in SPCN, these parts are mostly 



3 



irrelevant as they do not, in general, correspond to real 
filaments. 

4. MEASUREMENTS AND ROBUSTNESS VS 
OBSERVATIONAL BIASES 

Figure 121 shows the skeleton measured in DR4-350 for 
3 smoothing scales L — 16.4, 10.9 and 8.2 Mpc, (top 
left and 2 bottom panels) while the measurements of its 
length, £ as a function of L are summarized in Tabled 

As expected, the skeleton matches the intuitive vi- 
sual definition of what a filament is, and its length and 
complexity increase with the inverse of L. Notice on 
Fig.|21that the prominent features of the skeleton remain 
mostly independent of smoothing: decreasing L essen- 
tially adds new branches to the skeleton, corresponding 
to finer structures in the galaxy distribution. In other 
words, the skeleton grows like a tree, while L decreases. 
The overall scale dependence of the measured length, 
C = (52500 ± 6500)(L/Mpc)-i-^5±o.o6j^pg/(;L00Mpc)3, 
is in good qualitative agreement with the expected trivial 
power- law in the scale- free case, £ (x (SPCN). 

These results match very well the predictions of the 
standard ACDM model (compare MOCK to DR4-350). 
This allows one to use the mock catalogs as a solid base- 
line to test possible observational and dynamical effects 
on the skeleton, as discussed now, using Table H as a 
guideline. Incompleteness and discreteness effects can 
simultaneously be tested by comparing DR4-350 to its 
volume-limited counterpart, DR4-VL350, which probes 
only 15 percent of the galaxies available in DR4-350 (see 
the top panels of Fig. [21). They have little impact on the 
skeleton, changing its length by at most 3 percent. Edge 
effects arise from the particular geometry of the SDSS. 
They can be probed by comparing MOCK to MOCK-AS. 
They have a small but systematic impact on the mea- 
sured length of the skeleton, which is increasingly overes- 
timated with scale, from about 1 percent for L = 8.2 Mpc 
to 8 percent for L — 16.4 Mpc. In terms of scaling be- 
havior, C cx a is therefore slightly underestimated, 
which explains partly the slight deviation from the ex- 
pectation a = 2, in addition to the scale dependence 
of the power-spectrum of the density fiuctuations. Cos- 
mic variance should be small: when estimated from the 
dispersion among 25 realizations of MOCK, it increases 
with smoothing scale, as expected, from a 2 percent er- 
ror for L = 8.2 and L = 10.9 to a 5 percent error for 
L = 16.4. Redshift distortion effects, discussed at length 
in SPCN, can be tested by comparing MOCK to MOCK- 
PS. They have negligible impact on the measurements, 
well within the cosmic variance. Finally, since the skele- 
ton probes overdense regions of the universe and large 
smoothing scales are considered, the measurements are 
expected to be rather insensitive to effects of biasing and 
to be dominated by the predictions of linear dynamics. 
This has been fully confirmed when comparing the skele- 
ton of simulations at z = and in the initial conditions, 
at least in the ACDM cosmogony framework. Moreover, 
the small amplitude of the change of length of the skele- 
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ton when nonlinear biasing is applied can be checked in 
Tabic CI (compare MOCK-NB and MOCK). 



5. DISCUSSION: A TEST OF LSS FORMATION MODELS 

In this letter a method to probe the filamentary struc- 
ture in the galaxy distribution, involving the extraction 
of the local skeleton from the data and measuring its 
length per unit volume, C, was tested on the SDSS and 
mock catalogs. The length of the skeleton was found to 
be a robust statistic in the scaling regime 8.2 < L < 16.4 
Mpc, rather insensitive to nonlinearities, biasing, redshift 
distortion, incompleteness and cosmic variance. The re- 
sults were however slightly affected by edge effects due to 
the geometry of the SDSS (see Table 1 Column 6). StiU, 
one observes an excellent agreement with the ACDM con- 
cordant model. (See figure 3, top panel). 

One question remains: is the length of the skeleton a 
discriminant measure of large scale structure? In the- 
ory, the answer is positive: for a Gaussian field, C de- 
pends on the shape of the power-spectrum of density 
fluctuations, P{k), through its moments of order 2m, 
/ fc2™+2p(fc) exp(-fc2L2)dfc, up to TO = 3, leading to the 
approximate scaling C cx (6.2 -I- n)L~^ for P{k) oc fc" 
(SPCN). To demonstrate that this spectral dependence 
can be used to constrain models of large scale structure 
in practice, nine flat universe simulations were carried 
out with GADGET-2, involving 256^ particles and with 
the same cosmological parameters as previously used ex- 
cept that r^mattor was left as a free variable in the range 
0.1 < r^mattor < 0.9. From each of the simulations, 25 
mock catalogs were extracted, in which C was estimated. 
These measurements were used to perform standard 
analysis to find the best matching value of rJmattor for the 
SDSS, using MOCK as reference. The final la constraint 
is 0.25 < rJmatter < 0.4 (See figure 3). 

This clearly demonstrates that the length of the skele- 
ton is a discriminant estimator, 

which might prove to be a real alternative to traditional 
two-point statistics estimators which are extremely sen- 
sitive to the bias in the nonlinear stage of gravitational 
instability. The local skeleton extraction also opens new 
paths of investigation for the structure analysis of galac- 
tic or dark matter distribution, with the prospect of 
defining quantitatively the locus of filaments. In par- 
ticular, it will allow astronomers to carry measurements 
(velocity, pressure...) along th e main motorways of galac- 
tic infaU ijAubert et al. 2004|) . 
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Fig. 2. — The skeleton of the SDSS compared to the corresponding galaxy distribution. Top left and right panels: the galaxy distribution, 
respectively in DR4-350 and its volume-limited counterpart, DR4-VL350 with the superimposed skeleton measured for a smoothing scale 
L = 16.4 Mpc. Bottom left and right panels: the skeleton measured in DR4-350, for L = 10.9 and 8.2 Mpc, respectively. To have a better 
feeling of what the results look like in 3D, a mpeg movie is also available in the electronic edition of the Astrophysical Journal, where the 
skeleton is measured on the full SSDS-DR4 data for L = 16.4 Mpc. 
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Fig. 3. — Top panel: skeleton length per unit volume (in 
Mpc/(100 Mpc)"^) as a function of smoothing length for mock 
catalogs and SDSS showing the very good agreement (as ex- 
plained in the text). The error bars correspond to the cosmic 
variance, which is estimated via 25 realizations of the mock. 
Bottom panel: corresponding to the squared difference be- 
tween the total length per unit volume of the mock catalog and 
that of the SDSS, in units of the RMS of the mock; These 
curves yield a confidence interval for Omatter (for a flat universe) 
of [0.25,0.4] at one-cr level and [0.15,0.75] at two-o- level. The 
simulations are 256"^ dark matter particles of a given iTo = 70 
km/s/Mpc using an Eke prescription 0) for the normalization 
of the spectrum. 



